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FIXED-DIMENSIONAL ENERGY GAMES ARE IN 
PSEUDO-POLYNOMIAL TIME 

MARCIN JURDZINSKI, RANKO LAZIC, AND SYLVAIN SCHMITZ 


Abstract. We generalise the hyperplane separation technique (Chat- 
terjee and Velner, 2013) from multi-dimensional mean-payoff to energy 
games, and achieve an algorithm for solving the latter whose running 
time is exponential only in the dimension, but not in the number of ver¬ 
tices of the game graph. This answers an open question whether energy 
games with arbitrary initial credit can be solved in pseudo-polynomial 
time for fixed dimensions 3 or larger (Chaloupka, 2013). It also im¬ 
proves the complexity of solving multi-dimensional energy games with 
given initial credit from non-elementary (Brazdil, Jancar, and Kucera, 
2010) to 2EXPTIME, thus establishing their 2EXPTIME-completeness. 
Key Words. Energy game, bounding game, Erst-cycle game, vector 
addition system with states 


1. Introduction 

Multi-Dimensional Energy Games are played turn-by-turn by two players 
on a finite multi-weighted, game graph, whose edges are labelled with integer 
vectors modelling discrete energy consumption and refuelling. Player l’s ob¬ 
jective is to keep the accumulated energy non-negative in every component 
along infinite plays. This setting is relevant to the synthesis of resource- 
sensitive controllers balancing the usage of various resources like fuel, time, 
money, or items in stock, and finding optimal trade-offs; see [a nn in ns] 
for some examples. Maybe more importantly, energy games are the key in¬ 
gredient in the study of several related resource-conscious games, notably 
multi-dimensional mean-payoff games [7] and games played on vector addi¬ 
tion systems with states (VASS) |J, I] ill . 

The main open question about these games is to pinpoint the complexity 
of deciding whether Player 1 has a winning strategy when starting from a 
particular vertex and given an initial energy vector as part of the input. 
This particular given initial credit variant of energy games is also known 
as Z-reachability VASS games mM- The problem is also equivalent via 
logarithmic-space reductions to deciding single-sided VASS games with a 
non-termination objective [I], and to deciding whether a given VASS (or, 
equivalently, a Petri net) simulates a given finite state system [T2l [HI )9l[2j. 
As shown by Brazdil, Jancar, and Kucera [3], all these problems can be 
solved in ( d — 1)EXPTIME where d > 2 is the number of energy components, 
i.e. a TOWER of exponentials when d is part of the input. The best known 
lower bound for this problem is 2EXPTIM E-hardness [9], leaving a substantial 
complexity gap. So far, the only tight complexity bounds are for the case 
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d = 2: Chaloupka [5] shows the problem to be P-complete when using unit 
updates, i.e. when the energy levels can only vary by —1, 0, or 1. However, 
quoting Chaloupka ‘since the presented results about 2-dimensional VASS 
are relatively complicated, we suspect this [general] problem is difficult.’ 

When inspecting the upper bound proof of Brazdil et al. [3], it turns out 
that the main obstacle to closing the gap and proving 2EXPTIM E-completeness 
lies in the complexity upper bounds for energy games with an arbitrary ini¬ 
tial credit —which is actually the variant commonly assumed when talking 
about energy games. Given a multi-weighted game graph and an initial ver¬ 
tex v, we now wish to decide whether there exists an initial energy vector b 
such that Player 1 has a winning strategy starting from the pair (u,b). As 
shown by Chatterjee, Doyen, Henzinger, and Raskin [7], this variant is sim¬ 
pler: it is coNP-complete. However, the parameterised complexity bounds 
in the literature PE] for this simpler problem involve an exponential de¬ 
pendency on the number \V\ of vertices in the input game graph, which 
translates into a tower of exponentials when solving the given initial credit 
variant. 


Contributions. We show in this paper that the arbitrary initial credit prob¬ 
lem for d-dimensional energy games can be solved in time (|R| • ||i?||) 0 G 4 ) 
where |Vj is the number of vertices of the input multi-weighted game graph 



thus closing the gap left open in mm- Our parameterised bounds are of 
practical interest because typical instances of energy games would have small 
dimension but might have a large number of vertices. 


By the results of Chatterjee et al. 0, another consequence is that we 
can decide the existence of a finite-memory winning strategy for fixed¬ 
dimensional mean-payoff games in pseudo-polynomial time. The existence 
of a finite-memory winning strategy is the most relevant problem for con¬ 
troller synthesis, but until now, solving fixed-dimensional mean-payoff games 
in pseudo-polynomial time required infinite memory strategies [6j. 


Overview. We prove our upper bounds on the complexity of the arbitrary 
initial credit problem for d-dimensional energy games by reducing them to 
bounding games, where Player 1 additionally seeks to prevent arbitrarily high 
energy levels (Section 2.3). We further show these games to be equivalent 
to first cycle bounding games in Section 6} where the total effect of the first 
simple cycle defined by the two players determines the winner. More pre¬ 
cisely, first-cycle bounding games rely on a hierarchically-defined colouring 
of the game graph by perfect half-spaces (see Section 5), and the two players 
strive respectively to avoid or produce cycles in those perfect half-spaces. 

First-cycle bounding games coloured with perfect half-spaces can be seen 
as generalising quite significantly both 

• the ‘local strategy’ approach of Chaloupka 0 for 2-dimensional en¬ 
ergy games, and 
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Figure 1. A 2-dimensional multi-weighted game graph. 


• the ‘separating hyperplane technique’ of Chatterjee and Velner [6] for 
multi-dimensional mean-payoff games; see Section 4 for an overview 
of the latter approach. 

The reduction to first cycle bounding games has several important corol¬ 
laries: the determinacy of bounding games, and the existence of a small 
hypercube property , which in turn allow to derive the announced complexity 


bounds on energy games (see Section 3). In fact, we found with first-cycle 


bounding games a highly versatile tool, which we use extensively in our 
proofs on energy games. 

We start by presenting the necessary background on energy and bounding 
games in Section 2 Some omitted material on linear algebra can be found 


in Appendix A 


2. Multi-Weighted Games 


We define in this section the various games we consider in this work. 
We start by defining multi-weighted game graphs, which provide a finite 
representation for the infinite arenas over which our games are played. We 
then define energy games in Section 2.2, and their generalisation as bounding 
games in 


Section 2.3 


2.1. Multi-Weighted Game Graphs. We consider game graphs whose 
edges are labelled by vectors of integers. They are tuples of the form 
( V,E,d ), where d is the dimension in N, V '= V\ td Vi is a finite set of 
vertices, which is partitioned into Player 1 vertices (Vi) and Player 2 ver¬ 
tices (V 2 ), and FI is a finite set of edges included in V x 7L d x V, and such 
that every vertex has at least one outgoing edge; we call the edgelabels in 
Z d ‘weights’. 

Example 2.1. Figure [l] shows on its left an example of a 2-dimensional multi- 
weighted game graph. Throughout this paper, Player 1 vertices are depicted 
as triangles and Player 2 vertices as squares. □ 


2.1.1. Norms. For a vector a, we denote the maximum absolute value of 
its entries by ||a|| '= maxi<j<^ |a(i)|, and we call it the norm of a. By 

extension, for a set of edges E, we let ||i?|| '= max(„ )U y) GE ||u||. We assume, 
without loss of generality, that ||F?|| > 0 in our multi-weighted game graphs. 
Regarding complexity, we encode vectors of integers in binary, hence ||i£|| 
may be exponential in the size of the multi-weighted game graph. 
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2.1.2. Paths and Cycles. Given a multi-weighted game graph (V,E,d), a 
configuration is a pair (u, a) with v in V and a in ifi. A path is a finite 
sequence of configurations tt = (vq, ao)(ui, ai) • • • (v n , a n ) in (V x Z rf )* such 
that for every 0 < j < n there exists an edge (vj, aj + i — aj,Vj + i) in E (where 
addition is performed componentwise). The total weight of such a path n is 

w ( 7r ) = So <j<n a i+i ~ a j = a n ~ a o- 

A cycle is a path (vq, ao)(ui, ai) • • • (v n . a n ) with vq = v n . Such a cycle is 
simple if whenever vj = vy for some 0 < j < j' < n, then j = 0 and j' = n. 
We assume, without loss of generality, that every cycle contains at least 
one Player 1 vertex. We often identify simple cycles with their respective 
weights; the weights of the four simple cycles of the game graph in Figure l] 
are displayed on its right. 


Proposition 2.2. In any game graph (V,E,d), the total weight of any sim¬ 
ple cycle has norm at most |V| ■ ||.E||. 


2.1.3. Strategies and Plays. Let vq be a vertex from V. A play from vo is 
an infinite configuration sequence p = (vq, ao)(ui, ai) • • • such that ao = 0 
is the null vector and every finite prefix p\ n = ! (uo, ao) • • • (v n , a n ) is a path. 
Note that, because ao = 0, the total weight of this prefix is w(p\ n ) = a n . We 
define the norm of a play p as the supremum of the norms of total weights 
of its prefixes: ||p|| = f sup n ||u;(p| n )||. 

A strategy for Player p, p e {1,2}, is a function a p taking as input a 
non-empty path 7r • (v, a) ending in a Player p vertex v E V p , and returning 
an edge a p (i r • (u,a)) = (v,u,v') from E. A play p = (vq, ao)(ui, ai) • • • is 
consistent with a strategy a v for Player p if whenever v n is a Player p vertex 
in V p , then cr p (p\ n ) = (v n , a n+ i — a n ,v n+ i). Given two strategies a\ and (J 2 
for Player 1 and Player 2 respectively, and an initial vertex uo in V, observe 
that there is a unique play p VOjai ,a 2 f° r ^0 which is consistent with both o\ 
and cr 2 . 


Example 2.1 (continued). For instance, in the game graph depicted in Fig¬ 


ure 1, a strategy for Player 1 could be to move to vl whenever the current 


energy level on the first coordinate is non-negative, and to vr otherwise: 


oq(vr • (u 0 , a)) = 


(^o, (0,0) ,v L ) if a(l) > 0, 
(vq, (0,0), vr) otherwise, 


(1) 


and one for Player 2 could be to always select one particular edge in every 
vertex, regardless of the current energy vector—this is called a counterless 
strategy [3]—: 


/ t def J (v L , (-2,2) ,u 0 ) ifv = v L 
CT 2 (vr • (v, a)) ^ ^ ; ot h erw i se . 


( 2 ) 


These strategies define a unique consistent play for vq, which starts with 

{v 0 , 0, 0)(ul, 0,0)(u 0 , -2, 2 )(v R , -2, 2)(v 0 , 2, -l)(v L , 2, -l)(wo, 0,1). (3) 

□ 


In the following we consider several different winning conditions on plays, 
which define different games played on multi-weighted game graphs. 
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Figure 2. Self-covering strategy tree for Player 1 in the 


energy game of Figure 1 


2.2. Multi-Dimensional Energy Games. Suppose (V,E,d) is a multi- 
weighted game graph, vo an initial vertex, and b is a vector from N d . A 
play p from vq is winning for Player 1 in the energy game Ab (V,E,d) with 
initial credit b if, for all n, b + w(p\ n ) > 0, using the product ordering over 
7L d . Otherwise, Player 2 wins the play. As usual, this means that Player 1 
wins the energy game Ab(V, E, d ) in vo if there exists a winning strategy o\ 
for Player 1, i.e. cr\ is such that for all strategies 02 for Player 2 the play 
Pv 0 ,a-i,o- 2 is winning for Player 1. An immediate property of energy games is 
monotonicity, if a± is winning for Player 1 with some initial credit b, and 
b' > b, then it is also winning for Player 1 with initial credit b'. 


Example 2.1 (continued). For example, one may observe that the strat¬ 
egy @ for Player 1 is winning for the game graph of |Figure l| with initial 
credit (2,1) (or larger). A geometric intuition comes from the directions 
of the total weights of simple cycles in Figure f] by choosing alternatively 
edges to vl or vr, Player 1 is able to balance the energy levels above the 
‘x + y = O’ line. One way to see this more formally is to build the corre¬ 
sponding self-covering strategy tree up to the first time when a configuration 
is greater or equal to another configuration higher in the tree [3]. By mono¬ 
tonicity of the game, Player 1 can then repeat the same actions from those 
leaves. See Figure 2 for our example. 

Strategy o\ uses the comparison of a(l) with 0 as a soft bound to trigger 
a change of strategy and attempt to forbid cycles with a negative effect 
on the first coordinate. Note that the energy level a(l) might nevertheless 
become less than 0, but will remain > —2 at all times; we call this the hard 
bound. This follows the general scheme of Chaloupka [5]— and also ours—for 
Player 1 strategies. □ 


Example 2.3. As a rather different example, consider the multi-weighted 


game graph of Figure 3 Although Player 2 does not control any vertex, he 


wins the associated energy game for any initial credit b. 

To see this, consider any infinite path and let £ n ,r n and c n denote the 
numbers of times the left-hand-side self-loop, the right-hand-side self-loop, 
and the central simple loop are closed, respectively, in the first n steps. Note 
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Figure 3. A 2-dimensional game graph with only Player 1 
vertices. 


that the energy levels after the first n steps are (r n — l n ) — c n in the first 
component and — (r n — £ n ) — c n in the second component. If c n is bounded 
then clearly either r n —l n or —( r n —i n ) are not bounded from below, because 
one of i n and r n is bounded and the other is not. Consider the other case— 
when c n is not bounded—and for the sake of contradiction assume that both 
energy levels are bounded from below, and hence there is an initial credit 
b such that (r n — £ n ) — c n > b(l) and —(r n — £ n ) — c n > b(2). This is 
equivalent to r n — £ n > c n + b(l) and — ( r n — i n ) > c n + b(2). Since c n is not 
bounded, it follows that eventually both r n — i n and — (r n — £ n ) are always 
positive, which is absurd. □ 


2.3. Multi-Dimensional Bounding Games. A generalisation of energy 
games sometimes considered in the literature is to further impose a maximal 
capacity c e (also called an upper bound) on the energy levels during 
the play nu nsi- Player 1 then wins a play p if 0 < b + u;(p| n ) < c for all ra¬ 
in the spirit of the arbitrary initial credit variant of energy games, we also 
quantify c existentially. This defines the bounding game T(V,E,d ) over a 
multi-weighted game graph (V, E, d ), where a play p is winning for Player 1 
if its norm ||p|| is finite, i.e. if the set {||io(/o| n )|| : n G N} of norms of total 
weights of all finite prefixes of p is bounded, and Player 2 wins otherwise, 
if it is unbounded. In other words, Player 1 strives to contain the current 
vector within some d-dimensional hypercube, while Player 2 attempts to 
escape. 

For example, note that Player 2 is now winning the bounding game de¬ 
fined by the game graph of Figure 1, because he can ensure that the total 
energy Xu<i<d a n(*) increases with n along any play p = (u 0 , a 0 )( ui,a 1 ) ■ ■ ■ , 
hence the norm ||p|| is unbounded. Regarding the game graph of Figure 3 


since Player 2 wins the energy game for all initial credits, he has a uniform 
counterless winnning strategy, which is winning for all initial credits [3{ 
Lemma 19], and thus also winning in the bounding game. 


2.3.1. Lossy Game Graphs. As hinted above, if Player 1 wins the bounding 
game, then there exists some initial credit for which she also wins the en¬ 
ergy game. For a converse, let lossy(V, E, d) denote the lossy multi-weighted 
game graph obtained from (V, E, d) by inserting, at each Player 1 vertex and 
for each 1 < i < d, a self-loop labelled by the negative unit vector — e*. In a 
bounding game played over a lossy game graph, it turns out that Player 1 
can always bound the current vector from above by playing these unit decre¬ 
ments, hence she only has to ensure that the current vector remains bounded 









FIXED-DIMENSIONAL ENERGY GAMES ARE IN PSEUDO-POLYNOMIAL TIME 7 


from below, i.e. she has to win the energy game for some initial credit. For¬ 
mally: 

Proposition 2.4. From any vertex in any multi-weighted game graph (V, E, d ) 

(1) Player 1 wins the energy game Ab (V,E,d) for some b e N d if and 
only if Player 1 wins the bounding game T(\ossy(V,E,d)). 

(2) Player 2 wins the energy game Ab(V, E, d) for all b G N d if and only 
if Player 2 wins the bounding game r(lossy (V,E,d)). 


Proof sketch. We rely for this proof on results proven later in Section 6 By 
the determinacy of single-cycle bounding games and their equivalence with 
infinite bounding games, it suffices to prove the two ‘only if’ directions. 

For item 1[ assume that Player 1 has a winning strategy a i in the bounding 
game on lossy (V, E, d). In particular, there exists a bound B such that 
Player 1 can maintain ||p|| < B on any play p consistent with eri (an explicit 
such B is provided by |Lemma 3.1 ). Observe that along any consistent play, 
(ji must always eventually play an edge which is not a decrementing self¬ 
loop, or the play would eventually become unbounded from below. Hence 
the following strategy a[ which simply skips all the decrement actions is 
well-defined: 

(v,u,v') 


a[(n(v, a)) d A f <( 


if a)) = (v, u, v') with v / v' 

or VI < i < d . u — e,, 

<7^(tt(i7, a)(u, a — e*)) otherwise, i.e. if there is 1 < i < d 

s.t. ai(n(v, a)) = (u,-e^u). 

(4) 

By nronotonicity of energy games, this strategy is winning in the energy 
game on (V, E, d) with initial credit B on all components. 

For item 2[ assume that Player 2 has a winning strategy in the infinite 
bounding game on lossy(V, E. d). Then he has a winning strategy <72 in the 
first-cycle bounding game on lossy(V, E, D), which only plays colours that 
are disjoint from the non-negative orthant cone(ei,..., e^), because Player 1 
could otherwise use a decrementing self-loop to create a cycle outside this 
colour. Observe that, for any open half-subspace H that is disjoint from 
the non-negative orthant, and for any bound B, there exists a bound on 
the distances between the boundary of H and vectors in H whose every 
component is at least —B. It follows that the corresponding winning strategy 


<72 of Player 2 in the infinite bounding game (cf. Lemma 6.2) is also winning 
in the energy game Ab (V,E,d), for all initial credits b (cf. Claim 6.3). □ 


Our task in the following will be therefore to prove an upper bound on 
the time complexity required to solve bounding games. 


3. Complexity Upper Bounds 

Our main results are new parameterised complexity upper bounds for 
deciding whether Player 1 has a winning strategy in a given energy game. 
In turn, we rely for these results on a small hypercube property of bounding 
games, which we introduce next, and which will be a consequence of the 


study of first cycle bounding games in Section 6 
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3.1. Small Hypercube Property. In a bounding game, if Player 1 is 
winning, then by definition she has a winning strategy (J\ such that for all 
plays p consistent with or there exists some bound B p with ||p|| < B p . We 
considerably strengthen this statement in Section 6] where we construct an 
explicit winning strategy, which yields an explicit uniform bound B for all 
consistent plays: 


Lemma 3.1. Let ( V , E. d ) be a multi-weighted game graph. If Player 1 wins 
the bounding game r(V, E, d), then she has a winning strategy which ensures 

M < m\ ■ ii£ii ) 2( ' i+2)3 

for all consistent plays p. 


Note that our bound is polynomial in \ V\ the number of vertices, unlike 
the bounds found in comparable statements by Brazdil et al. [3} Lemma 7] 
and Chatterjee et al. |8j Lemma 3], which incur an exponential dependence 
on |P|. This entails pseudo polynomial complexity bounds when d is fixed: 


Corollary 3.2. Bounding games on multi-weighted graphs (V, E , d) are solv¬ 
able in deterministic time (|P| • ||i?||) 0 ^ 4 ' ) . 


Proof. By |Lenmia 3.1, the bounding game is equivalent to a reachability 


game where Player 2 attempts to see the norm of the total weight exceed 
B = f (4|P| • ||p;||) 2 ( d + 2 h l . This can be played within a finite arena of size 
(B + l) rf and solved in time linear in that size using the usual attractor 
computation algorithm. □ 


3.2. Energy Games with Arbitrary Initial Credit. The arbitrary ini¬ 
tial credit problem for energy games takes as input a multi-weighted game 
graph and an initial vertex vq and asks whether there exists a vector b in 
N d such that Player 1 wins Ab(P, E, d) in uq: 


Theorem 3.3. The arbitrary initial credit problem for energy games on 
multi-weighted game graphs ( V,E,d ) is solvable in deterministic time (|P| • 
\\E\\)°( di) . 


Proof. This follows from Proposition 2.4 and Corollary 3.2 applied to the 
game graph T(lossy (V, E, d,)). □ 


3.3. Energy Games with Given Initial Credit. The given initial credit 
problem for energy games takes as input a multi-weighted game graph ( V, E, d), 
an initial vertex vo, and a credit b in H d and asks whether Player 1 wins 
the energy game A^(V,E,d) in vo'- 

Theorem 3.4. The given initial credit problem with credit b for energy 
qames on multi-weiqhted game graphs (V, E, d) is solvable in deterministic 
time (\V\ ■ \\E\\ ■ (\\b\\ + l)) 2 ° (d ' loed) . 


This is again a pseudo-polynomial upper bound when d is fixed, and encom¬ 
passes Chaloupkas P upper bound in dimension d = 2 with unit updates, 
i.e. with 11EH = 1. Because the given initial credit problem for energy games 
of fixed dimension d > 4 is EXPTIME-hard [9], the bound in terms of ||E|| in 


Theorem 3.4 cannot be improved. 
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Thanks to Lemma 3.1 [ a proof of Theorem 3.4 could be obtained by 
bounding on the height of self-covering strategy trees, using the work of 
Brazdil et al. [3j, and more generally the techniques of Rackoff [15] . We 
sketch an arguably simpler proof next, which does not explicitly mention 
self-covering trees. Throughout the remainder of this section, we let (V, E , d ) 
denote some multi-weighted game graph, v be in V and b in N rf . 


3.3.1. Fenced Games. The main difference between energy games with a 
given initial credit b and those with arbitrary initial credit is that Player 2 
wins the former as soon as the current total weight goes below — b(i) on some 
coordinate i. We can enforce a similar condition on bounding games. Given 
I C {1,... ,d}, we say that Player 2 wins a play p in the fenced bounding 
game r^U, E,d) if 

• there exists n in N and i in I such that (b + w(p\ n ))(i) < 0, or 

• ||p|| = lo as in the usual bounding game, 

and Player 1 wins otherwise. Observe that the fenced bounding game 
r^(V, E,d) is simply the bounding game T(V,E,d). Using the fact that 
fenced games are equivalent to bounding games played on double exponen¬ 
tially larger arenas (as we sha ll see next in|§3.3.2 ), we can essentially repeat 
the arguments of the proof of Proposition 2.4 and obtain: 


Claim 3.5. For both p £ {1,2}, Player p wins the energy game Ab (V,E,d) 
from vertex v iff Player p wins the fenced bounding game r| ) 1 ’"'’^(lossy(U, E, d)) 
from vertex v. 


3.3.2. Capped Game Graphs. Fenced bounding games allow to enforce the 
given initial credit of an energy game through semantic means, by modifying 
the winning conditions of bounding games. We now show that the same 
effect can be obtained syntactically, by considering a bounding game played 
over a different game graph. 

Consider for this a coordinate 1 < i < d. The idea in the following is to 
track the current value of w(p\ n )(i) in the vertices of the game graph, and 
go to a Player 2-winning vertex v± if it ever goes below —b(i). The issue 
with that plan is that w(p\ n )(i) might grow unboundedly high during the 
course of a play. However, by |Lemma 3.1[ we know that if Player 2 manages 
to force this value above 

B = (4\V\ ■ \\E\\)^ d+2 ^ , (5) 

then he also wins the game. 

We construct accordingly a capped d-dimensional multi-weighted game 
graph cappedb (V,E,d) from ( V,E,d ) as follows: 

• its set of Player 1 vertices is {nj_} l±J V\ x {—b(i),..., B} and its set 
of Player 2 vertices is V 2 x {-b(*),...,S}; 

• for any vertex ( v,a ), and for any edge (v,u,v') in E with — b(i) < 
a + u(i) < B , it has an edge (( v , a), u, (v', a + u(i))), 

• for any vertex (v , a), if there is an edge (v , u, v') in E with a + u (i) < 
— b(i) or a + u(i) > B, it has an edge (( v , a), u, u_l); 

• finally v± has only a self-loop labelled by some unit vector, and is 
therefore losing for Player 1. 
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Claim 3.6. For any I C {1, ..., d}, i fL I, and all p £ {1, 2}, Player p wins 
the fenced bounding game r[, U ^(V, E, d) from vertex v iff Player p wins the 
fenced bounding game r b (capped^(C, E, d)) from vertex (u,0). 


Proof of \Theorem S.J\ For any v £ V and p £ {1,2}, using Claim 3.5 and 
using [Claim 3.6 successively for i = 1,..., d, Player p wins the energy game 
A b (V, E. d) from vertex v iff Player p wins the fenced bounding game 


Pb (capped^ (■ • • cappedb(lossy(V, E, d)) ■ • •)) (6) 

from (v. 0 ), which is the same as the bounding game 

P^cappedb(- • • cappedb(lossy(V, E, d)) ■ ■ ■ )^ . (7) 

As capping preserves the norm of edge labels and the dimension, all we 
need to do is compute the number of vertices in that game graph. Let 
A t be the number of vertices in capped}, (• • • cappedb(lossy(V, E, d)) ■ ■ ■ ) and 

Ao == |V|; by ([5]), we see that 

Ai +1 = 1+Ai (b(i + 1) + (471,11 E\\ ) 2(d+2 ) 3 ) < (4Ai||£'||(||b|| + i))2(«H-2) 3 +i . 

(8) 

We show by induction over i that 

4A,||-E||(||b|| + 1) < (4|V| • ||77||(||b|| + l)) 2l ( rf + 3 ) Sl . (9) 


This holds for i = 0, and for the induction step, 


4A m p||(||b|| + 1) <4(4^11^11(1^11 + l)) 2(d+2)3+1 \\E\\(\\b\\ + 1) by (§ 

< 4(4|V| • ||^||(||b|| + i))2 i (d+ 3 ) 3i -( 2 (<i+2) 3 + 1 )|| jE ;||(||b|| + 1) 

< (4|V| • ||F/||(||b|| + i)) 2i (rf+3) 3l -(2(<i+2) 3 +i)+i 

<(4|l/M|£||(||b|| + l)) 2 ‘"' J + 3 > ,,,+1 ’. 


We conclude using |Corollary 3.2| 


□ 


Remark 3.7 (Pareto Limit). Another consequence of our bounds is that we 
can effectively compute the set of minimal initial credits that allow Player 1 
to win a given energy game, sometimes called the Pareto limit of the game. 
By Proposition 2.4 and Lemma 3.1, these vectors have a norm bounded by 
an exponential value in d but polynomial in |V| and ||F/||, and we can use 


Theorem 3.4 to test for each such vector whether it suffices for Player 1 to 
win the game. This algorithm works in double exponential time when d is 
part of the input but pseudo-polynomial time when d is fixed. □ 


4. Multi-Dimensional Mean-Payoff Games 

This section summarises the technique for solving multi-dimensional mean- 
payoff games proposed by Chatterjee and Velner [6], which relies on open 
half-spaces. The rest of the paper does not rely formally on this section 
and it may be omitted by a reader eager to get on with our new ‘perfect 
half-spaces’ technique for solving multi-dimensional bounding games. We 
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believe, however, that starting here helps put our work in context, appreci¬ 
ate similarities and differences between the two techniques, and understand 
the conceptual and some of the technical challenges we had to overcome. 

4.1. Multi-Dimensional Mean-Payoff Games. Given a play p over a 

multi-weighted game graph (V,E,d), we define its long-term average in Q d 
as avg(p) = lim, woo We say that p is winning for Player 1 in the 

mean-payoff game M(V,E,d ) if avg(p) > 0. Otherwise, i.e. if there is a 
coordinate 1 < i < d such that avg(/?)(i) < 0, the play p is winning for 
Player 2. As shown by Chatterjee et al. [7|, determining the winner in multi¬ 
dimensional mean-payoff games is coNP-complete, and in pseudo-polynomial 
time when the dimension is fixed |6j Theorem 1]. 

4.2. Energy Versus Mean-Payoff. In a one-dimensional arbitrary-initial- 
credit energy game, the goal of Player 1 is to keep the energy level bounded 
from below. It is folklore that Player 1 has a winning strategy in such a 
game if and only if she has a strategy in the mean-payoff game on the same 
game graph that guarantees a non-negative long-term average. 


4.2.1. Infinite Memory Strategies for Player 1. This relationship between 
energy games and mean-payoff games does not generalise to multi-dimensional 
games. We illustrate this on the example of a 2-dimensional game graph 
from Figure 3 In Example 2.3| we have argued that Player 2 has a winning 
strategy in the bounding game (and hence also in the arbitrary-initial-credit 
energy game). On the other hand, we argue that Player 1 has a strategy 
to guarantee that the long-term average in both dimensions is non-negative. 
Indeed, consider a strategy in which in stage m —for all m = 1,2,3,...— 
Player 1 performs one of the self-loops m times, then she moves to the 
other vertex where she performs the other self-loop m times, and then fi¬ 
nally returns to the starting vertex. After m stages, the energy level in 
both dimensions is — m and the number of steps performed is 0(m 2 ), hence 
the long-term average in the infinite play is 0 in both dimensions, because 


llTTl — TfL — Q 

n m ra—>oo m 2 — 


Note that this strategy for Player 1 in the game graph of Figure 3 


is 


infinite-memory, since the actions depend on the stage m. Multi-dimensional 
mean-payoff games might require infinite memory in order to be won, as 
shown by Chatterjee et al. [3 Lemma 4]—their proof can be used to show 
that the game of Figure 3] actually requires infinite memory. 


4.2.2. Finite Memory Strategies for Player 1. In the multi-dimensional case, 
there is nevertheless a strong relation between energy and mean-payoff games. 
Call a strategy a finite memory if there exists an equivalence relation ~ with 
finite index over ( V x r L d ) + such that, whenever ir ~ ir' for some non-empty 
paths 7r and if in the domain of a, then <r(7r) = ccfn') (such strategies are 
typically described using Moore machines). 

Fact 4.1 (Chatterjee et al. [7]). Let (V,E,d) be a multi-weighted game 
graph. There exists an initial credit b such that Player 1 wins the energy 
game Ab (V, E, d) if and only if Player 1 has a finite memory winning strat¬ 
egy in the mean-payoff game M(V, E. d). 
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Hence our complexity bounds in |Theorem 3.3| on multi-dimensional energy 
games also yield a pseudo-polynomial time algorithm to find a winning finite- 
memory strategy for Player 1 in a given fixed-dimensional mean-payoff game. 


4.3. The Open Half-Space Technique for Mean-Payoff Games. For 

technical convenience, we follow Chatterjee and Velner in considering mean- 
payoff games on lossy game graphs (see Section 2.3 for a formal definition 
of lossy game graphs). In this context, the goal of Player 1 is to achieve 
a long-term average of 0 in all dimensions, and the goal of Player 2 is to 
achieve a negative long-term average in at least one dimension. 


4.3.1. Winning Strategies for Player 2. The first key observation that under¬ 


is a vertex vq, an open half-space H C M. d and a strategy for Player 2 that 
guarantees all simple cycles formed along a play from uo to be in H. One can 
then argue that if Player 2 uses such a strategy indefinitely then the norms 
of the energy level vectors grow linearly in the number of steps performed, 
and hence the long-term average is non-zero in at least one dimension. 

Observe that every open half-space can be determined by a non-zero vec¬ 
tor n E that is normal to the hyperplane on the boundary of the half¬ 
space: 

H n = {v E R d : n • v < 0} . 

Chatterjee and Velner [6] Lemma 1] crucially point out that for every vec¬ 
tor n E one can check whether Player 2 has a strategy that guarantees 
all simple cycles formed to be in H n from uo by solving a one-dimensional 
mean-payoff game on the multi-weighted game graph with every weight u 
replaced by the dot-product n ■ u. 


pins the solution of (lossy) multi-dimensional mean-payoff games by Chat¬ 


terjee and Velner is the following sufficient condition for the existence of a 


winning strategy for Player 2 from some vertex in the game graph: there 


4.3.2. Winning Strategies for Player 1. The second key insight of Chatterjee! 
land Velnerl is that the above sufficient condition for the existence of a win¬ 
ning strategy for Player 2 in a lossy multi-dimensional mean-payoff game is 
necessary. Indeed, by (positional) determinacy of mean-payoff games [TO] , it 
follows that, if the sufficient condition described above does not hold for any 
open half-space H n and any initial vertex vo, then for all non-zero vectors n 
and all vertices vq, Player 1 has a (positional) strategy to block simple cycles 
in H n along any play from vo, or in other words to force all simple cycles 
formed to be in 


R d \H n = H_ n = {v E R d : n ■ v > 0} . 

In such a case, Chatterjee and Velner [6] Lemma 2] show that such strate¬ 
gies of Player 1, which force simple cycles formed to be in any closed half¬ 
space, can be carefully combined to ensure that the long-term average is 0 
in every dimension. The main idea in the construction of the strategy for 
Player 1 is to proceed in stages m = 1, 2, 3,..., to monitor the energy-level 
vector at the beginning of stage rn of the game, say g m , and to ‘counter¬ 
act’ its further growth in the direction of g m throughout stage m by using 
the strategy that blocks simple cycles in the open half-space H Sm , i.e., that 
forces all the formed simple cycles to be in the closed half-space iL_ gm . In 
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the winning strategy we described for Player 1 for the mean-payoff game 

Player 1 can avoid cycles in by play- 

she can avoid cycles in by playing the 


over the graph of Figure 3 


ing the self-loop on vr, anc. 
self-loop on vr. 

Moving from one stage to another, and hence switching between such 
counteracting strategies to force simple cycles in different half-spaces, cannot 
be done too often because as a result of switching from one strategy to 
another a bounded number of unfavourable simple cycles may be formed. 
This is the case in our example, since switching between vr and vr closes 
a cycle with effect (-1,-1) resulting in a drift away from the non-negative 
orthant. 


The strategy for Player 1 proposed by Chatterjee and Velner 


overcomes 

this complication by increasing the number of steps made in every stage; 
in particular, they proposed making s(m) = m steps in stage m before 
proceeding to stage m + 1. The purpose is to make the drift grow slower 
than the number of steps in the play. This, as can be deduced from their 
analysis, gives a bound of 0(n 3 / 4 ) for the norm of the energy-level vector 
after n steps, and hence the long-term average is 0 in all dimensions because 
lim n _ s . 0O r] -— = 0. One may observe that more generally, if we set s(m) == 
m e , for any e > 0, then the norm of the energy-level vector after n steps is 
Q( n i/ 2 +e/ 2 (i+£))_ H enc6) best upper bounds on the norm of the energy- 
level vectors after m steps that can be guaranteed by Player 1—when using a 
strategy similar to that constructed by Chatterjee and Velner—are in uj(y/n). 
Let us point out that such strategies require infinite memory because they 
need to ‘keep the count’ of the stage they are in and of the number of steps 
they need to perform in the current stage, both of which are unbounded. 


5. Perfect Half-Spaces 

We recall in this section the definition of subsets of Q d called perfect 
half-spaces, which can also be characte rised as th e maximal salient blunt 
cones in Q d . They will be used next in Section 6 to define a condition for 


Player 2 to win bounding games, which relies on Player 2’s ability to force 
cycles inside perfect half-spaces. This can be understood as a generalisation 
of Chatterjee and Velner’s approach for solving multi-dimensional mean- 
payoff games, which relies on a similar ability to force cycles inside open 
half-spaces. We employ perfect half-spaces in |Section 6 to colour the edges 
in first-cycle bounding games, which determine the winner using both the 
colours and the weight of the first cycle formed along a play. 


5.1. Definitions from Linear Algebra. Given a subset A of Q , we write 
span (A) (resp., cone(A)) for the vector space (resp., the cone ) generated by 
A, i.e., the closure of A under addition and under multiplication by all 
(resp., nonnegative) rationals. 

Observe that the sufficient condition for existence of a winning strategy 
for Player 2 in a lossy multi-dimensional mean-payoff game is also a sufficient 
condition for him to have a winning strategy in a bounding game. Unlike 
for multi-dimensional mean-payoff games and as witnessed with the game 


on Figure 3, however, this condition is not necessary. In order to formulate a 
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new more powerful sufficient condition, we use instead perfect half-spaces: a 
k-perfect half-space of Q d , where k E {1,2,, d }, is a (necessarily disjoint) 
union Hd U • • • U H & such that: 

• Hd is an open half-space of 

• for all j E { k , ... ,d — 1}, Hj C is an open half-space of the 
boundary of Hj + \. 

Whenever we write a A:-perfect half-space in form Hd U • • • U H &, we assume 
that each Hj is j-dimensional. We additionally define the (d + l)-perfect 
half-space as the empty set; a partially-perfect half-space is then a /c-perfect 
half-space for some k in {1,..., d+1}. A perfect half-space is a 1-perfect half¬ 
space. Observe that a partially-perfect half-space is always a cone, which 
is blunt , i.e., does not contain 0, and salient , i.e., if it contains a vector v 
then it does not contain its opposite —v. Moreover, a perfect half-space is 
a maximal blunt and salient cone. 


5.2. Generated Perfect Half-Spaces. In order to pursue effective and 
parsimonious strategy constructions, we consider perfect half-spaces gener¬ 
ated by particular sets of vectors, which will correspond to the total weights 
of simple cycles in multi-weighted game graphs. Given a norm M in N, 
we say that an open half-space H is M-generated if its boundary equals 
span(B) for some set B of vectors of norm at most M. By extension, a 
partially-perfect half-space is M-generated if each of its open half-spaces is 
M-generated. 

Proposition 5.1. Any M-generated k-dimensional vector space of Q d has 
at most C{k) = 2(2 M + l)^* *- 1 ) open half-spaces that are M-generated. 


Example 5.2. In the game graph of Figure 3, there are three 1-generated 


open half-spaces of interest: the half-plane H 2 l = {(x,y) : x + y < 0} with 
boundary span((— 1,1), (1, —1)) and containing (—1, —1), and the two half- 

lines H\ '= {(x\ y) : x + y = 0 A x < 0} and H['= {(x,y) : x + y = 0 A x > 0} 
with boundary span(0) and containing respectively (—1,1) and (1,-1). In 
turn, those three open half-spaces define two perfect half-spaces: H 2 U H\ 
and H 2 U H [. □ 


5.3. Hierarchy of Perfect Half-Spaces. Finally, we fix a ranked tree-like 
structure on all M-generated partially-perfect half-spaces, which provide a 
scaffolding on which we will build strategies in multi-dimensional bounding 
games. Observe that an M-generated partially-perfect half-space Hd U • • • U 
Hk for k > 1 can be extended using any of the M -generated open half-spaces 
H of the boundary of H note that this boundary then equals span(M). 
In Example 5.2[ H 2 can be extended using H\ or H[, and span(Mi) = 

span(Mj) = {(x, y) : x + y = 0}. 

The set of M -generated perfect half-spaces can be totally ordered by 
positing a linear ordering < between all M --generated open half-spaces. We 
write -< for the lexicographically induced linear ordering between all M- 
generated perfect half-spaces of Q d : if H = Hd U • • • U H\ and H' = H' d U 

• ■ ■ U H[, we define % -< H' to hold iff Hj = Hj for all j E {k + 1,..., d} and 
Hk < H' k for some A; E {1, 2,..., d}. 
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6. First Cycle Bounding Games 


We define in this section first-cycle bounding games , which provide the key 
technical arguments for most of our results. Such games end as soon as a 
cycle is formed along a play, and the weight of this cycle determines the win¬ 
ner, along with a colouring information chosen by Player 2. In sections 6.2 


and 6.3, we are going to show that first-cycle bounding games and infinite 
bounding games are equivalent, by translating winning strategies for each 
Player p, p e {2,1}, from first-cycle bounding games to bounding games. 
This yields in particular the small hypercube property of Lemma 3.1| 


6.1. Definition. We define the first-cycle bounding game G(V,E,d ) on a 
multi-weighted game graph ( V,E,d ): 

• at any Player-1 vertex, Player 2 chooses a | V| • || E \\-generated perfect 
half-space % of Q d , and then Player 1 chooses an outgoing edge, 
whose occurrence in the play becomes coloured by 'H\ 

• at any Player-2 vertex, he chooses an outgoing edge; 

• the game finishes as soon as a vertex is visited twice, which produces 
a simple cycle C with coloured Player-1 edges; 

• Player 2 wins if w(C), the total weight of the cycle, is in the largest 
partially-perfect half-space of Q d that is contained in all the colours 
in C, or in other words the least common ancestor of all the colours 
in C- 

• Player 1 wins otherwise. 


Example 6.1. Player 2 wins the first-cycle bounding game played on Figure 3 


Indeed, he can choose the colour H -2 U H\ in vl and the colour H -2 U H[ in 
vr. Then Player 1 cannot avoid forming a simple cycle in either H 2 U H\ (if 
cycling on vl), in H 2 U H\ (if cycling on vr), or in H 2 (if cycling between 
vr and vr). □ 


Observe that first-cycle bounding games are finite perfect information games, 
and are thus determined , i.e. Player 1 wins from a vertex iff Player 2 loses 
from the same vertex and vice versa. 


6.2. Winning Strategies for Player 2. Player 2 might not have the 
power to choose a perfect half-space and then force all simple cycles formed 
to be in that perfect half-space; the bounding game on Figure 3 is a glaring 
example of this phenomenon. Nevertheless, Player 2 can very nearly achieve 
this: if he has a winning strategy, then he can win by colouring every edge 
taken in a play with a perfect half-space, and by forcing every simple cycle 
formed to be in the largest partially-perfect half-space that is contained in 
all the colours of edge occurring in the cycle. 

Suppose a is a strategy of Player 2 from a vertex vq in a first-cycle bound¬ 
ing game G(V, E, d) . Let a be the following strategy of Player 2 in the infinite 
bounding game T(V,E,d): 


• at any Player-2 vertex, a chooses the edge specified by a; 

• whenever a cycle is formed, a cuts it out of its memory, and continues 
playing according to a. 
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\f>n\ 



Figure 4. Lengths of the simple paths p n in the proof of 
Lemma 6.2 


Lemma 6.2. If a is winning for Player 2 in G(V,E,d) from some vertex 
vo, then a is winning for Player 2 in r(V, E, d) from the same vertex vq. 


Proof. Consider any infinite play p consistent with a, and let: 

• p be obtained from p by colouring all Player l’s edges with the 
\V\ ■ 11FI11-generated perfect half-spaces of Q d as specified by a\ 

• Ci, C 2 ,... be the cycle decomposition of p, and for each n, p n be the 
simple path that remains after removing C n ; 

• Pn be the largest partially-perfect half-space of Q d that is contained 
in all the colours in C n , for each n. 


Along the infinite play p, the prefixes p n might get shorter or longer but 
are always of length bounded by |V|. Those lengths are traced in blue in 


Figure 4 We let I be the minimal such length that occurs infinitely often. 


Let us call a partially-perfect half-space that occurs infinitely often in the 
sequence P\,Pi ,••• a recurring one. We want to show that, among the 
recurring partially-perfect half-spaces, there is one that is contained in all 
the others; the subsequent |Claim 6.3 will then allow to conclude. 

First observe that, for any n < n! such that \p n \ > \p n /\ and, for all 
m £ {n + 1,..., n! — 1}, \p n \ < \p m \, we have that either P n contains P n > or 
vice-versa (since \p n \ is bounded by |F| this situation must occur infinitely 
often). Indeed, in this case C n starts with (and p n ends with) a vertex 
v\ that occurs in C n >. Consider the sequence of vertices v±,... ,Vk-i,Vk 
visited along the cycle C n until the first Player 1 vertex v & (recall that by 
assumption all the simple cycles must visit some Player 1 vertex). Because 
in step n! Player 2 plays according to < 7 , he will choose the same actions in 
Player 2 vertices vi,... ,Vk-i as in step n. Hence Vk is also visited inside 
C n r, and Player 2 uses the same edge colouring at steps n and n' in v^. Thus 
the two cycles C n and C n ‘ share a colour, and the corresponding partially 
perfect half-spaces P n and P n > must be comparable for inclusion. 

Assume now that there are two incomparable (for inclusion) recurring 
partially perfect half-spaces P) and P $ and let us show that some partially 
perfect half-space P C P\Pf must also be recurring. Consider the infinite 
suffix of the play where l is the minimal observed length. By the previous 
observation, between any two occurrences of P 1 and P} in this suffix, we can 
find a sequence P 1 = P n , Pi 2 ,..., Pi h = P} of comparable partially perfect 
half-spaces connecting the two, i.e., with P l;i C Pi j+1 or P- Lj D Pi j+1 for all 
j. This is because there will be a later occurrence of a simple path pi m of 
length I < mind/jjJ, \pi k \) for some i m > h,ik ; see the thick dashed violet 
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Figure 4 Then some H C W ,H^ occurs among those Hi 


,U 


^k ’ 


line in 

Since there are infinitely many such pairs of occurrences of H) and H} but 
hnitely many \V\ ■ ||Fi|| -generated partially-perfect half-spaces, there must 
be infinitely many occurrences of one such H. 

Applying the previous reasoning to every pair of recurring partially perfect 
half-spaces H t and H}, we see that there must be a recurring partially perfect 
half-space that is contained in all the others. We conclude the proof using 
the following Claim 6.3. □ 


Claim 6.3. Suppose H = H c i U • • • U Hf is a partially-perfect half-space of 

Q d and ai, a 2 ,... is an infinite sequence of vectors such that: 

• the set {ai, a 2 ,...} is finite; 

• for each n, there exists a partially-perfect half-space of Q d that con¬ 
tains H and a n ; 

• we have a„ E H for infinitely many n. 

Then there exist k! E {d ,..., k} and N > 0 such that 

• for each n > N, + • • • + a n belongs to Hy the topological closure 
of Hy, and 

• the set of all distances of bln + • • • + a n from the boundary of Hy is 
unbounded. 

In particular, the set of all norms ||ai + • • • + a n || is unbounded. 


Proof. We have that H is of the form H ^ U • • • U H^. Let k' E {d ,..., k} be 
maximal such that a„ E Hy for infinitely many n. Observe that {1, 2,...}, 
the set of all positive integers, can be partitioned into three: 

(1) The set of all n such that a n E H d U • • • U Hy + \, which is finite 
by definition of k'. We let N be larger than the index of the last 
such a ra ; then the vectors a n for n> N belong to Hy. 

(2) The set of all n such that a n E Hy, which is infinite by definition 
of kl. Since Hy is open and the set {ai,a 2 ,...} is finite, there is 
a positive minimal distance of those a n from the boundary of Hy. 
These vectors bring the sums a at + • • ■ + a„ for > N unboundedly 
far from the boundary of Hy. 

(3) The set of all n such that a n is contained in the boundary of Hy. 

These vectors have no effect on the distance between the sums ajv + 
• • • + a n for n > N and the boundary of Hy. □ 


6.3. Winning Strategies for Player 1. If there is no winning strategy for 
Player 2 in the first-cycle bounding game G(V, E, d) from a vertex vq, i.e. no 
colouring by perfect half spaces that allows him to win, then by determinacy 
of first-cycle bounding games, there is a winning strategy a for Player 1 in 
G(V,E,d ) from vq against all possible colourings. This is our main technical 
result. It consists in constructing a finite-memory winning strategy a for 
Player 1 in the infinite bounding game T(V,E,d) from the same vertex vq, 
which balances her various ‘perfect half-space avoidance strategies’ in order 


to ensure the small hypercube property stated in Lemma 3.1 
The memory of a consists of: 
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a simple path: 7 from the initial vertex vo to the current vertex v, 
in which Player l’s edges are coloured by \V\ ■ ||E|| -generated perfect 
half-spaces of 

a colour: i.e. a \V\ • ||-EH-generated perfect half-space PL = Hd U- • • UH\ 
of Q d ] 

counters: c (k,W) for every k G {1,2, and for every nonzero 

total weight W of a simple cycle, which are natural numbers. 

Strategy tf copies its moves from strategy a for the first-cycle bounding 
game, based on the coloured simple path and the colour it has in its memory. 

6.3.1. Simple Path Updates. The simple path is maintained in a straight¬ 
forward manner: whenever a cycle is formed it is removed from the simple 
path; otherwise the current vertex is added to the simple path. 

6.3.2. Colour Updates. The perfect half-space Pi in the memory is used to 
avoid cycles with weights in PL, relying for this on the strategy a. It is ini¬ 
tially the -(-minimal \V\ ■ ||E||-generated perfect half-space. During the play, 
it is either left unchanged, or updated if the current energy level threatens 
to become unbounded. These updates can take two forms: 

• a k-shift to H' k > moves to the -(-minimal perfect half-space of 
the form Hd U • • • U H^+i U H' k U • • • U H[, and 

• a k-cancellation moves to the -(-minimal perfect half-space of the 
form Hd U • • • U Hk +1 U H' k U • • • U H[. 

The colour updates define the main phases of the strategy a. A k-event is 
either a fc-shift or a fc-cancellation. By a k-month we mean a maximal period 
with only <&:-events. By a k-year we mean a maximal period with only <k- 
cancellations and <Eshifts. This hierarchy of k-events mirrors in some sense 
the hierarchical structure of \V\ ■ ||EH-generated perfect half-spaces. 

6.3.3. Counters Updates. The purpose of the counters is to determine when 

to perform the fc-events, and to choose the appropriate H ' k s for ^-shifts. 
Together with the current path, they also provide the current energy level, 
which equals w( 7 ) + c (^> W) ' W throughout the play, where W ranges 

over all cycle weights, and help the strategy contain this energy within a 
small hypercube. The strategy thus aims at bounding these counters. 

Initially, all the counters c(k, W) have value 0. Each such counter keeps 
track of how many simple cycles with weight W have been formed since the 
latest >k-eve nt, where it was reset to 0. The behaviour in presence of <k- 
shifts and </c-cancellations is very transparent: their values are incremented 
each time a cycle of weight IT 7 ^ 0 is formed, and they are never decreased 
during a k- year. As we are going to see, the /c-cancellation operation breaks 
down the straightforward link between the counter values and numbers of 
occurrences of cycle weights, by subtracting positive numbers from counters. 
This preserves the invariant on the current energy level: counter decreases 
performed in a ^-cancellation always correspond to removing a (non-simple) 
cycle whose total weight is 0. 

Let us define for all 1 < k < d the k-soft bound 

U{k) = (4|T| • ||£||) 2fc(d+2)2 . 


( 10 ) 
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We say that the A;-soft bound holds for a cycle weight W if c(k, W) < U(k). 
Whenever the fc-soft bound fails for a cycle weight W in == span {Hk)\Hk, 
the strategy a should avoid any further increases, and attempts to perform a 
fc-shift to some H' k . The aim is for every cycle weight W with c(k, W) > U{k) 


to belong to H' k . 

If no such H' k exists, then, by standard results in integer linear algebra 


see 


Appendix A), all the counters c(k',W) for k! >k s.t. c(k,W) > U(k ) can 


be cancelled out by subtracting an appropriate multiple of a small positive 
integral solution of a system of linear equations, so that the /c-soft upper 
bounds are re-established. 

This strategy shows a statement dual to |Lemma 6.2 , and thereby entails 
both the equivalence of infinite bounding games with first-cycle bounding 
games and the small hypercube property of[L enirna 3Tj 


Lemma 6.4. If a is winning for Player 1 in G(V,E,d ) from some vertex 
vo, then a is winning for Player 1 inY(V,W, d) from vq , and ensures energy 
levels of norm at most (4|V| ■ ||_E||) 2 ( d + 2 ) 3 . 


6.3.4. a Summarised. Let us first summarise the definition of a. At any 
Player-1 vertex, a chooses the edge that a specifies for history 7 and perfect 
half-space H. After any move that leads to a vertex not occurring in 7 , 
the memory of a is updated only by extending 7 . Otherwise, a cycle C is 
formed, and the memory is updated as follows: 


Cycle C is cut out of 7 . For all k € {1,2, , d}, counters c(k, w(C)) 
are incremented, unless w(C) = 0. 

If all soft upper bounds hold, that is if for all k £ {1,2,..., d} and 
all simple-cycle weights W £ H/. we have c(k,W) < U{k), then the 
memory update is finished. 

Otherwise, let k £ { 1 , 2 ,..., d} be the largest for which the k -soft 
upper bound c (k, W) < U(k ) fails for some W £ H 
(fc-shift) If there is a |V|-||.E||-generated open half-space H of span (Hk) 
such that the fc-soft upper bound holds for all simple-cycle weights 
W £ H, then denoting by H' k the <-minimal such H , H is replaced 
by the -(-minimal perfect half-space of form Hj U • • • U iLfc+i U H' k U 
• • • U H[. All counters c (k', W), where k' £ {1, 2,..., k — 1} and W 
is a simple-cycle weight, are reset to 0 . 

(fc-cancellation) Otherwise, let W\, W 2 , ■ ■ ■, W n be all the non-zero 
simple-cycle weights in span (H^) that fail the fe-soft upper bound, 
and let A be the matrix whose columns are the vectors Wi, W 2 , ■ ■ ■, W n . 
Then by duality and existence of small positive integer solutions of 


systems of linear equations (Lemma A.2 Lemma A.4 and Proposi¬ 


tion 2.2), it follows that Ax = 0 has a solution in positive integers 

bounded by S(k) = f (2(|I/|-||£'||-|-l))( fc+2 ^. The perfect half-space Pi 
is replaced by the -(-minimal |V| • | A ||-generated perfect half-space 
of the form Hd U • • • U H^+i U H' k U • • • U H[ . Let 


u{k) == (4|V| • ||£||)( 2 ^-i)0+ 2 ) 2 


(11) 
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define the k-hard bound 


U{k) + u{k) . 


(12) 


For every Wj and every k! G {k, k + 1,..., d}, the value of c(k', Wi) 
is replaced by 

c(k', Wi) — u(k) • x(i) . (13) 


All counters c(k' , VF), where k! G {1,2,..., k — 1} and W is a simple- 
cycle weight, are reset to 0. A /^-cancellation is well-defined if all the 
differences in (13) are non-negative. 


We prove Lemma 6.4 through a sequence of claims. The first claim shows 
that /^-cancellations are always well-defined: 


Claim 6.5. Every k-cancellation is well-defined. 


Proof. We need to show that, just before the ^-cancellation, for every cycle 
weight Wi in span (H^) that fails the k -soft upper bound, and all k' > k , 
c(k',Wi ) > u(k ) ■ x(i). Indeed, 

c(k\ Wi) > c(k, Wi) by monotonicity of the counters for k, 

> U(k) since Wi fails the fe-soft upper bound, 

= (4|H| • ||£'||)( 2fe - 1 )( d + 2 )' (4|C| • ||F ;||)( d+2 ) 2 

> u(k) ■ S(k) since S(k) = (2 (|C| • \\E\\ + i))(^-t- 2 ) 2 , 

> u(k) ■ x(i) by|Lemma A.4| □ 


As explained before, the k- soft bound U{k) in (10) is employed by a to 
trigger a /c-event and a change of strategy to avoid cycles with weight inside 
some perfect half-spaces. However, this change of strategy might allow a few 
more instances of those cycles to be formed—but, crucially, no more than 
u[k ) further instances. The fc-hard bound in (12) is therefore enforced. 

This informal argument is proven formally in the following Claim 6431 If 
entails in particular that the d-hard bound is always enforced, since span {Hfi) 
is the whole space 


Claim 6 . 6 . For allk G {1,2,..., <i} and for all cycle weights W in span {H^), 

(1) (k-soft bound) at the beginning of every k-year, c(k, W) < U{k), and 

(2) (k-hard bound) c(k, W) < U{k) + u(k). 


Proof. We prove the two statements by nested induction, first on k and 
second on the sequence of k- years seen so far. 

Let us start with For the initial k- year, and for A;-years that begin 
just after a >fc-shift or a >/c-cancellation, since then c(k, W) = 0 , (jTj) holds 
trivially. We are left with the case of a k- year that begins just after a 
^-cancellation. We can assume using the secondary induction hypothesis 
that ([2]) holds at the end of the previous A;-year. Consider then some Wi in 
span (Fife) that fails the /c-soft upper bound just before that /^-cancellation. 
At that time, since span (iFfc) was not changed by the ^-cancellation, ([ 2 ]) 
applies and c (k,Wi) < U{k) + u(k). Therefore, at the beginning of the k- 
year, c (k, Wi) <U{k) + u{k) — u(k ) ■ x(i), and thus c(k, Wf) < U{k ) since 
x(i) > 0 . 
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By the secondary induction, it remains to establish ([ 2 ]) for every k- year 
such that ([I]) held at its beginning—this is the heart of the proof. Let 
Hi , 1 HN be the sequence of ^-dimensional open half-spaces consid¬ 

ered during the N /c-months spanned by the current k- year so far, where 
N < C{k) by Proposition 5.1 We know that all these open half-spaces de¬ 
fine the same vector space span (H\) = span(fl|) = • • • = span(iL^); let XV 
belong to that space. 

If XV satisfies the soft bound, there is nothing to be done. Otherwise, 
by the construction of a and the assumption of 0 at the beginning of the 
/c-year, there exists a first /c-month in this sequence, say the Lth for some 
1 < L < N, after which XV fails the soft bound onward. Then, during the 
/c-months 1,..., L, c (k, XV) <U(k), and for all n G {L + 1,..., N}, we know 
that w belongs to the closure Hf:. 

Consider the nth /c-month for n G {L + 1,..., N} in the current k- year; 
we want to bound the increase on c(/c, XV) during that /c-month. There are 
two cases: 


If XV G HJf: then the /c-dimensional space in the current colour is left 
unchanged during the nth /c-month. In turn, this means that no 
vertex of the game graph can be visited twice during that /c-month 
while forming a cycle of weight XV, as otherwise <r would allow to 
form a cycle with effect inside H^ U • • • U HJl and Player 1 would lose. 
Therefore, cycles with weight XV that are closed during the nth k- 
month can only be formed by consuming edges from the simple path 
at the beginning of the /c-month. Hence, c(/c, XV) can be increased 
by at most |V|. 

Otherwise: XV belongs to the boundary HV\Hff of Hf; and thus k > 1. 
In this case, during the nth /c-month, c(/c, XX) can only be increased 
by at most the maximal value of c(/c — 1 , W ) during the same k- 
month. This is because c(/c — 1, XX) is 0 at the beginning of the nth k- 
rnonth, and thereafter it can only decrease through </c-cancellations 
(or that /c-month would have ended), which decrease c(k,W) by 
the same value. By the main induction hypothesis for (JT]) with XV G 
span(iLfc-i) C (H^\Hjf), c(/c — 1 , W) is less than U(k — 1 ) + u(k — 1 ). 

We conclude that, during the current k- year, since N — L < C(k), 

• if k = 1, c(k,XV) —U(k) is less than C( 1) • \V\ = 2\V\ < u( 1), and 

• if k > 1 , 


c(/c, XX) — U{k) < C(k) ■ m.ax.(\V\,U(k — 1) + u(k — 1)) 

< C(k)-2 U(k-l) 

< (4|V| • ||£||) (d+2)2 /2 • 2(4|V| • H^ll) 2 ^- 1 )^ 2 ) 2 

= u(k) . □ 


Proof of Lemma 6.4 ■ By the previous claims, a is winning for Player 1 in the 


infinite bounding game, and thanks to the d-hard bound, it ensures that the 
norm of the current energy is bounded by || tu( , y) || + 'f2w(H(d) + u(d )) • ||W|| 
where XV ranges over the total weights of simple cycles in the game graph, 
and wfiy) is the weight of a simple path. Hence the norms || tu('y) || and 
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are bounded by (|V| • ||.E||) d . Finally, there are at most (2(|V| ||£7||) rf + l) d 
different total weights of simple cycles W. □ 


7. Concluding Remarks 


In this paper, we have shown in Theorem 3.3| and Theorem 3.4 that fixed¬ 
dimensional energy games can be solved in pseudo-polynomial time, regard¬ 
less of whether the initial credit is arbitrary or fixed. For the variant with 
given initial credit, this closes a large complexity gap between the TOWER 
upper bounds of Brazdil, Jancar, and Kucera [3] and the lower bounds of 
Courtois and Schmitz [Sj, and also settles the complexity of simulation prob¬ 
lems between VASS and finite state systems [5]: 


Corollary 7.1. The given initial credit problem for energy games is 2EXPTIME- 
complete, and EXPTIME -complete in fixed dimension d > 4. 


The main direction for extending these results is to consider a parity 
condition on top of the energy condition. Abdulla, Mayr, Sangnier, and 
Sproston [1] show that multi-dimensional energy parity games with given 
initial credit are decidable. They do not provide any complexity upper 
bounds—although one might be able to show TOWER upper bounds from 
the memory bounds on winning strategies shown by Chatterjee et al. 0 
Lemma 3]—, leaving a large complexity gap with 2EXPTIM E-hardness. This 
gap also impacts the complexity of weak simulation games between VASS 
and finite state systems [T] . 
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Appendix A. Linear Algebra 


An Alternatives Lemma. Given a norm M in N, we write for the set 
of integers {—M,..., M}. We say that a vector space, cone, or half-space 
in <Q d is Al-generated iff it can be generated by vectors in (Z^) d . We use 
Weyl's Theorem 0 


Theorem A.l (Weyl [T9] . Theorem 1). Any d-dimensional cone gener¬ 
ated by a finite set A in Q d is the intersection of a finite number of closed 
half-spaces, where the boundary of each half-space contains d — 1 linearly 
independent vectors from A. 


Lemma A.2. Suppose A C ( Z^-) d is contained in an M-generated subspace 
S ofQ d . Either A is contained in some M-generated closed half-space of S, 
or X^aeA^2>o a contains the zero vector. 


Proof. If the cone(A) is not the whole space span(A), then by Theorem A.l 


it is contained in an M-generated closed half-space H of span(A). Since S is 
M-generated, it is easy to obtain from H an M-generated closed half-space 
of S that contains cone(A) D A. 


l‘In these days the angel of topology and the devil of abstract algebra fight for the 
soul of each individual mathematical domain.’ [T8j 
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Otherwise, if cone(A) is the whole space span (A), it contains in particular 
the vectors — X)aeA a (from span(A)) and (from cone(A)), and thus 

SaeA ( Q>o a contains the zero vector. □ 


Small Solutions. We also use a lemma that bounds the positive integral 
solutions on systems of linear equations. The lemma is a corollary of the 
following result of von zur Gathen and Sieveking m- 

Theorem A. 3 (von zur Gathen and Sieveking [46]). Let A, b, C, d be 
m x n-, m x 1-, p x n-, p X 1-matrices respectively with integer entries. The 
rank of A is r, and s is the rank of the {m + p) x n-matrix (c) • Let M be 
an upper bound on the absolute values of those (s — 1) x (s — 1)- or s x s- 
subdeterminants of the [m + p) x (n + 1)-matrix (c d)? which are formed 
with at least r rows from (A,b). If Ax = b and Cx > d have a common 
integer solution, then they have one with coefficients bounded by (n + 1 )M. 


Lemma A.4 (Small Solutions Lemma). Suppose A is a d x n-matrix with 
entries from and mutually distinct columns. If Ax = 0 has a solution 
in positive rationals, then it has a solution in positive integers bounded by 
(2(M + l))( r+2 ) 2 , where r is the rank of A. 


Proof. We can assume that d = r. Apply Theorem A.3 with b the d- 
dimensional zero vector, C the n-dimensional identity matrix and d the 
n-dimensional vector of ones. The absolute value of any subdeterminant of 
(mi) is a t most n d+l M d . Since n < (2 M + l) d , we have that 


(n + 1 )n d+1 M d < 2(2(M + l)) d (2(M + 1 ))d{d+i) M d = 

2 ( d+1)2 (M + l) d(d+3) < ( 2 (M + l)) (d+2)2 . □ 
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